README for data folder

This folder contains four data sets used in estimation and which are called by the codes in the 
Model folder. It also contains subfolders with necessary codes to build the panel G1kmMiRC_V7.mat and
transportation cost estimates DataTransCost08212015.mat (see subfolders README files for their specific documentation)

%%%  AgregateCS_10222014.mat:
Contains municipality-year level data. Variables from this data set used in estimation:
 - munic:        Municipality identifier
 - year:         Year
 - cs_corn_area: Municipality corn area from 2006 Agricultural Census
 - cs_soy_area:  Municipality soy area from 2006 Agricultural Census
 - meso_reg:     Identifier for Mesoregion (IBGE grouping of municipalities with similar characteristics) 
 
%%%  Prices_05212014.mat:
Contains yearly times series of prices. Variables from this data set used in estimation:
 - year:       Year
 - sb1brl_lag: Lagged Sugar # 11 in Brazilian reais 
 - c1brl_lag:  Lagged corn price (CBT) in Brazilian reais
 - s1brl_lag:  Lagged soy price (CBT) in Brazilian reais
 - e:          Exchange rate (BRL/US) from the Brazilian Central Bank
 
%%%  G1kmMiRC_V7.mat:
Contains micro level data of field characteristics compiled from several sources. Includes panel of sugarcane
land use decisions from CANASAT. NB: It also includes other variables which are not used in the final version of the article.
 - FID_Grid:            Identifier of field (cross-section dimension)
 - S2003 to S2013:      Panel of sugarcane land use information from 2003 to 2013. 
                        See FieldAge.m in Codes folder on how to translate and interpret the dataset.
 - munic:               Municipality identifier
 - SRD2004 to SRD20013: Sugarcane road distance. On road distance (in meters) from field to closest sugarcane field.
 - Urban:               Dummy urban = 1
 - Lake:                Dummy water = 1
 - SugarTransportCostR: Sugar transportation cost from field to port in Brazilian reais.
 - ucs:                 Dummy Protected area = 1
 - mb_uso_detalhe:      MapBiomas (Colecao 4.1, 2004) land use classification. See MapBiomas/mapbiomas_class.pdf.
 - mb_bioma:            = 1 (Atlantic Forest), = 2 (Cerrado), = 3 (Amazon)
 - Altitude             Altitude (meters)
 - prec_growth:         Mean precipitation over growth season (November to April)
 - gaez_aecol_suc:      Agroecological potential yields (high input) for Sugarcane from FAO GAEZ
 - gaez_aecol_mze:      Agroecological potential yields (high input) for Maize from FAO GAEZ
 - gaez_aecol_soy:      Agroecological potential yields (high input) for Soybeans from FAO GAEZ

%%%  DataTransCost08212015.mat:
Contains data for transportation cost model estimation.
 - effective_dist:     Effective on the road distance between pair of origin-destination computed using
                       Data/TransCost/TransCostEsalqLogDestinations.py
 - esalq_cost:         Transportation cost per tonne of sugar from ESALq LOG
 - euclidian_dist:     Euclidian distance (in meters) between pair of origin-destination 
 
